On Predicting Software Development Effort using Machine Learning Techniques and Local Data
نویسندگان
چکیده
This paper analyses the accuracy of predictions for software development effort using various machine learning techniques. The main aim is to investigate the stability of these predictions by analyzing if particular techniques achieve a similar level of accuracy for different datasets. Two key assumptions are that (1) predictions are performed using local empirical data and (2) very little expert input is required. The study involves using 23 machine learning techniques with four publicly available datasets: COCOMO, Desharnais, Maxwell and QQDefects. The results show that the accuracy of predictions for each technique varies depending on the dataset used. With feature selection most techniques provide higher predictive accuracy and this accuracy is more stable across different datasets. The highest positive impact of feature selection on the accuracy has been observed for the K* technique, which has generated the most accurate predictions across all datasets.
منابع مشابه
Bridging the semantic gap for software effort estimation by hierarchical feature selection techniques
Software project management is one of the significant activates in the software development process. Software Development Effort Estimation (SDEE) is a challenging task in the software project management. SDEE is an old activity in computer industry from 1940s and has been reviewed several times. A SDEE model is appropriate if it provides the accuracy and confidence simultaneously before softwa...
متن کاملمروری بر روشهای تخمین هزینه نرمافزار مبتنی بر یادگیری ماشین
Software project management software is the most important activity in software development, because it contains the whole software development process, from beginning to end. Software cost estimation is a challenge task in the software project management. It is an old activity in computer industry from 1940s and has been developed many times. Effort, only covers part of the cost of a software ...
متن کاملPredicting software project effort: A grey relational analysis based method
The inherent uncertainty of the software development process presents particular challenges for software effort prediction. We need to systematically address missing data values, outlier detection, feature subset selection and the continuous evolution of predictions as the project unfolds, and all of this in the context of data-starvation and noisy data. However, in this paper, we particularly ...
متن کاملUsing Machine Learning to Predict Project Effort: Empirical Case Studies in Data-Starved Domains
Ideally, software engineering should be able to use machine learning to control or significantly decrease the costs associated with building software. In reality, there are very few examples of applying such applications early in the software life cycle. One reason for the scarcity of examples is the lack of empirical data in the software engineering discipline. This dilemma is quite evident wh...
متن کاملMachine Learning and Citizen Science: Opportunities and Challenges of Human-Computer Interaction
Background and Aim: In processing large data, scientists have to perform the tedious task of analyzing hefty bulk of data. Machine learning techniques are a potential solution to this problem. In citizen science, human and artificial intelligence may be unified to facilitate this effort. Considering the ambiguities in machine performance and management of user-generated data, this paper aims to...
متن کامل